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Abstract — Most communication systems use some form of 
feedback, often related to channel state information. The common 
models used in analyses either assume perfect channel state 
information at the receiver and/or noiseless state feedback links. 
However, in practical systems, neither is the channel estimate 
known perfectly at the receiver and nor is the feedback link 
perfect. In this paper, we study the achievable diversity multi- 
plexing tradeoff using i.i.d. Gaussian codebooks, considering the 
errors in training the receiver and the errors in the feedback 
link for FDD systems, where the forward and the feedback are 
independent MIMO channels. 

Our key result is that the maximum diversity order with one- 
bit of feedback information is identical to systems with more 
feedback bits. Thus, asymptotically in SNR, more than one 
bit of feedback does not improve the system performance at 
constant rates. Furthermore, the one-bit diversity-multiplexing 
performance is identical to the system which has perfect channel 
state information at the receiver along with noiseless feedback 
link. This achievability uses novel concepts of power controlled 
feedback and training, which naturally surface when we consider 
imperfect channel estimation and noisy feedback links. In the 
process of evaluating the proposed training and feedback proto- 
cols, we find an asymptotic expression for the joint probability 
of the SNR exponents of eigenvalues of the actual channel and 
the estimated channel which may be of independent interest. 

Index Terms — Channel state information, diversity multiplex- 
ing tradeoff, feedback, multiple access channel, outage probabil- 
ity, power-controlled, training. 



I. Introduction 

Channel state information at the transmitter has been well 
established to improve communication performance, mea- 
sured either as increased capacity (see e.g. [1-6]), improved 
diversity-multiplexing performance [7-18] or higher signal- 
to-noise ratio at the receiver [2, 19, 20] among many possible 
metrics. Several of the aforementioned works have considered 
the impact of incomplete channel knowledge at the trans- 
mitter, by considering quantized channel information at the 
transmitter which can be visualized to be made available by 
the receiver through a noiseless finite-capacity feedback Unk. 
While the model and subsequent analysis clearly shows that 
reduced channel information at the transmitter can lead to 
significant performance gains due to channel knowledge, a 
key requirement is that the receiver knows what the transmitter 
knows (even if there is an error in feedback link). In practice, 
the communicating nodes are distributed and have no way of 
aligning their channel knowledge perfectly. 
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In this paper, we systematically analyze the impact of 
mismatch in channel knowledge at the transmitter and receiver. 
For clarity of presentation, we will largely focus on a single- 
user point-to-point Unk (one transmitter and one receiver) and 
only at the end of the paper, extend the results to the case of 
multiple-access channels (many transmitters and one receiver). 
The key departure from prior work is that we expUcitly 
model both the forward and feedback links as fading wireless 
channels. A little thought immediately shows that all practical 
wireless networks have "two-way" communication links, that 
is, the nodes are transceivers such that all the received and 
transmitted packets (control, data, feedback) travel over noisy 
fading channels. The two-way model to analyze feedback was 
first proposed in [10] for TDD systems, where it was shown 
to enable accurate resource accounting of the feedback link 
resources (power and spectrum) and analyze the important 
case of transmitter and receiver channel knowledge mismatch. 
A key message from [10] was that in the general case when 
the transmitter and receiver knowledge is mismatched, both 
feedback and forward communication has to be jointly de- 
signed. The simple two-way training protocol proposed in [10] 
led to the concept of power-controlled training which enabled 
joint estimation of transmitter's intended power control and 
actual channel realization. We will continue the line of thought 
initiated in [10] for the case of FDD (Frequency Division 
Duplex) systems and more importantly, systematically show 
how mismatch between information at the transmitter and 
receiver impacts the overall system performance. 

For the case of FDD systems, we will continue to model 
the forward and feedback links as fading channels. However, 
unhke [10], the forward and feedback channel will be as- 
sumed to be completely uncorrelated since they use different 
frequency bands. In this case, the two-way training protocol 
proposed in [10] will not be applicable and we will use 
the quantized feedback model, in which the receiver sends 
a quantized version of its own channel information over the 
feedback hnk. 

Our main result is an achievable diversity-multiplexing 
tradeoff for an m x n MIMO (Multiple Input Multiple Output) 
system, with measured channel information at the receiver 
which is used to send quantized channel information via a 
noisy link using i.i.d. Gaussian codebooks. We show that, 
in the scope of modeling assumptions, diversity-multiplexing 
tradeoff with one-bit of noisy feedback with estimated CSIR 
(channel state information at receiver) is identical to diversity- 
multiplexing tradeoff with perfect CSIR and noiseless one-bit 
feedback. Even more importantly, more bits of feedback do 
not improve the maximum diversity order in the imperfect 



2 



system (estimated CSIR with noisy feedback) in contrast of the 
ideaUzed system where the maximum diversity order increases 
exponentially [8, 13, 17]. The conclusion holds for multiuser 
systems in general. 

The encouraging news from our analysis is two-fold. First, 
even noisy and mismatched information about the channel 
at both transmitter and receiver is sufficient to improve the 
whole diversity-multiplexing tradeoff when compared to the 
case with no feedback. Second, very few bits of feedback, in 
fact one-bit, is all one needs to build in a practical system. 
However, to achieve any diversity order gain over the no- 
feedback system, a straightforward application of the Genie- 
aided feedback analysis [8, 13, 17] fails. That is, conventional 
training followed by conventionally coded quantized feed- 
back signal leads to no improvement in diversity order. A 
combination of power-controlled training (much like in [10]) 
and the new concept of power-controlled feedback (proposed 
in this paper) appear essential for significant improvement 
in diversity-multiplexing gain compared to the no-feedback 
system. 

We will build our main result in three major steps. In 
the first step, we assume that the receiver knowledge of 
the channel is perfect and the feedback channel is error- 
prone. Thus, the transmitter information about the channel 
is potentially different from that sent by the receiver. In this 
case, the coding of the feedback signal becomes important. 
If the feedback information about the channel is sent using 
a codebook where each codeword has equal power, then the 
maximum diversity order is 2mn irrespective of the amount 
of feedback information. In contrast, 6-bit noiseless feedback 
leads to a maximum diversity order grows exponentially in 
number of bits [8, 13]. The reason for such dramatic decrease 
(from unbounded growth with b in the noiseless case to 
maximum 2mn in the noisy case) is that the transmitter 
cannot distinguish between rare channel states, especially 
those where it is supposed to send large power to overcome 
poor channel conditions. As a result, it becomes conservative 
in its power allocation and the gain from power control does 
not increase unboundedly as in the case of noiseless feedback. 
A careful examination of the outage events on the feedback 
path naturally points to unequal error protection in the form 
of power controlled feedback coding scheme, where the rare 
states are encoded with higher power codewords. With power 
controlled feedback, the diversity order can be improved to 
{mnf' + 2mn, a substantial increase compared to constant 
power feedback. 

As our second step, we assume that the channel knowledge 
at the receiver is imperfect but the feedback path is error-free. 
We first analyze the commonly used protocol setup, where the 
receiver is trained at the average available power SNR (assum- 
ing noise variance is one) to obtain a channel estimate and 
then a quantized version of the estimated channel is fed back 
to the transmitter. While both the transmitter and receiver have 
identical information, the error in receiver information leads 
to no gain in the diversity order compared to a no-feedback 
system. We take a cue from an earlier work in [10] and the 
power-controlled feedback mechanism, and propose a two- 
round training protocol, where the receiver is trained twice. 



First round training uses an average power SNR and then 
after obtaining the channel feedback, the receiver is trained 
again with a training power dependent on the channel estimate. 
This implies that in poor estimated channel conditions, second 
round training power is higher than the first and in good 
conditions, it is lower. The adaptation of training power is 
labeled power-controlled training and allows a substantial 
increase in maximum diversity order ((mn)^ -t-mn) compared 
to no-feedback case (mn). For general multiplexing gains, the 
achievable diversity order is identical to that obtained when the 
receiver knows perfect channel state information with 1 bit of 
perfect feedback in [13]. Further, additional bits of feedback do 
not help in improving the maximum diversity order at constant 
rates when the receiver is trained to obtain a channel estimate. 

Finally, we put the first two steps together to analyze the 
general case of noisy channel estimates with noisy feedback. 
The key conclusion is that the receiver channel estimate errors 
are the bottleneck in the maximum achievable diversity order. 
Thus, the maximum achievable diversity order at multiplexing 
gain r = is (mn)^ -|- mn, which is less than [mn)'^ + 
2mn obtained in the case of perfect receiver information. For 
general multiplexing, we get the same tradeoff as with 1 bit of 
perfect feedback and the receiver having perfect channel state 
information as in [13]. This diversity multiplexing tradeoff 
can be achieved with a single bit of noisy feedback from the 
receiver. 

The results on MIMO point-to-point channels are extended 
to the multiple access channel in which all the transmitters 
have m antennas each and the receiver has n antennas, and 
all the conclusions drawn earlier for the single user also holds 
in a multiple access system. The channel information for each 
transmitter is measured at the receiver and a global quantized 
feedback is sent from the receiver to all the transmitters. We 
use the similar combination of power control training and 
feedback as in single-user systems to achieve a diversity order 
of {mn)'^ +mn with a single bit of power-controlled feedback 
in the main case in which the errors in the channel estimate 
and the feedback link are accounted. 

We note that the improvement in diversity order is com- 
pletely dependent on our use of power control as adaptation 
mechanism. For example, in MIMO systems, feedback can be 
used for beamforming (e.g. [19, 20]). However, beamforming 
does not lead to any change the diversity order and only 
increases the receiver SNR by a constant amount. Since we 
have focused on the asymptotic regime, we do not consider 
schemes like beamforming which have identical diversity- 
multiplexing performance as a non-feedback based system. 
In this paper, we find new achievable schemes to improve 
the diversity multiplexing tradeoff which are better than the 
traditional approaches but do not claim globally optimality of 
these schemes. 

The rest of the paper is organized as follows. Section 
11 describes preliminaries on channel model and diversity 
multiplexing tradeoff. Section III summarizes the known re- 
sults for the case when the receiver knows perfect channel 
state information and the feedback is sent over a noiseless 
channel [13]. Section IV describes the diversity order when 
the receiver knows the channel perfectly while the transmitter 
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receives feedback on a noisy channel. Section V describes the 
diversity multiplexing tradeoff when the receiver is trained to 
get channel estimate while the feedback link to the transmitter 
is noiseless. Section VI consider both the above errors, i.e., it 
considers receiver estimating the channel and the feedback link 
is also noisy. Section Vn presents some numerical results. We 
consider the extension to multiple access channels in Section 
VIII. Section IX concludes the paper. 



II. Preliminaries 

A. Two-way Channel Model 

We will primarily focus on single-user multiple input-output 
channel with the transmitting node denoted by T and receiving 
node denoted by R. Later, we will extend the main result to 
the multiple access channel in Section VIE. For the single-user 
channel, we will assume that there are m transmit antennas 
at the source node and n receive antennas at the destination 
node, such that the input-output relation is given by 



Forward Channel 



J ^R:Y = HX + W, 



(1) 



where the elements of H and W are assumed to be i.i.d. with 
complex Normal distribution of zero mean and unit variance, 
CA''(0, 1). The matrices Y,H,X and W are of dimension 
n X Tcoh ,nxm,Tnx Tcoh and n x T^oh, respectively. Here Tcoh 
is coherence interval such that the channel H is fixed during a 
fading block of Tcoh consecutive channel uses, and statistically 
independent from one block to another. The transmitter is 
assumed to be power-limited, such that the long-term power 
is upper bounded, i.e, ^;^tracc(E [XX^]) < SNR. 

Since our focus will be studying feedback over noisy 
channels, we assume that the same multiple antennas at the 
transmitter and receiver are available to send feedback on an 
orthogonal frequency band. For the feedback path, the receiver 
will act as a transmitter and the transmitter as a receiver. As 
a result, the feedback source (which is destination for data 
bits) will have n transmit antennas and feedback destination 
(which is source of data bits) will be assumed to have m 
receive antennas. Furthermore, a block fading channel model 
is assumed 

R^T :Yf = HfXf + Wf, (2) 

where Hf is the MIMO fading channel for the feedback 
link, and the Wf is the additive noise at the receiver of 
the feedback; both are assumed to have i.i.d. CN{0, 1) el- 
ements. The feedback transmissions are also assumed to be 
power-limited with a long-term power constraint given by 



- trace (E 



) < SNR/. Without loss of generality, 
we will assume the case where the transmitter and receiver 
have symmetric resources, such that SNR = SNR/. 

We note that a phase-symmetric two-way channel model 
with H = HJ was studied earlier in [10,11]. The phase- 
symmetric two-way channel is a good model for slow-fading 
time division duplex (TDD) systems. On the other hand, the 
above SNR-symmetric (SNR = SNR/) model is weU-suited 
for symmetrically resourced FDD systems. 
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Fig. 1. Two-way MIMO channel. The forward and feedback channels use 
the same antennas. If the forward and feedback are frequency-division duplex 
(FDD), then H and H / are not equal. On the other hand, if the forward and 
feedback channels are time-division duplex (TDD), then H = Hj is often a 
reasonable assumption. 

B. Obtaining Channel State Information 

The two-way channel model allows two-way protocols, 
as depicted in Figure 2, where the transmitter and receiver 
can conduct multiple back-and-forth transactions to complete 
transmission of one codeword. The model was used in [10, 
11] to study the diversity-multiplexing performance of two- 
way training method. In this paper, we will focus on another 
instance of the two-way protocols which will involve channel 
estimation and quantized feedback. However, unlike [10, 11], 
we will not account for resources spent on feedback path 
since the methods developed in [10] directly apply to the 
current case. Instead, we will focus on the more important 
issue of understanding how the mismatch in the transmitter 
and receiver channel information affects system performance. 

To develop a systematic understanding of the two-way 
fading channel described in Section II-A, we will consider 
two forms of receiver knowledge about the channel H. The re- 
ceiver will either be assumed to know the channel H perfectly 
or have a noisy estimate H obtained using a minimum mean- 
squared channel estimate (MMSE) via a training sequence 
(described in detail in Section V). 

For the transmitter knowledge, the receiver will quantize 
its own knowledge (H or H) and map it to an index Jr, 
where Jr G {0, 1, . . . , — 1} where K > 1 is the number of 
feedback levels used by the receiver for feedback. The receiver 
then transmits the quantized channel information Jr over the 
feedback channel and the transmitter knowledge of the index 
is denoted by Jy. For the case when the feedback channel 
is assumed to be noiseless, Jj = Jr, else Jj ^ Jr with 
finite probabihty. Here the error probability will be depend 
on the channel signal-to-noise ratio SNR and the transmission 
scheme. Based on the quantized information about the channel, 
in the form of Jj, the transmitter adapts its codeword to min- 
imize the probability of outage (defined in the next section). 
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Fig. 2. Two-way protocols: In a two-way channel, the protocols can involve 
many exchanges between the transmitter and receiver. The arrows indicate the 
direction of transmission. 
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C. Diversity-Multiplexing Tradeoff 

We will consider the case when then the codeword X 
spans a single fading block. Based on the transmitter channel 
knowledge Jj — i, the transmitted codeword is chosen from 
the codebook = Xi(2), • • • ,X,(2^^-")K where 

the codebook rate is R. All Xi(fc)'s (for 1 < fc < 2^'^"°'^) are 
matrices of size m x Tcoh. We assume that Tcoh is finite and 
does not scale with SNR. In this paper, we will only consider 
single rate transmission where the rate of the codebooks 
does not depend on the feedback index. Furthermore, we 
will assume codebooks Ci are derived from the same base 
codebook C by power scaling of the codewords. In other 
words Ci = \f~PiC, where the product implies that each 
element of every codeword is multiplied by ^JPi where each 
element of codeword X{k) e C has unit power. Thus, Pj is 
the power of the transmitted codewords. Recall that there is an 
average transmit power constraint, such that E(Pi) < SNR. We 
further assume that the base codebook C consists of Gaussian 
entries, in other words we only focus on Gaussian inputs in 
this paper. 

In point-to-point channels, outage is defined as the event 
that the mutual information of the channel for a channel 
knowledge H G ^ with some distribution Phi^ (for ? a 
subset of all n X m matrices) at the receiver, I^{X;Y) is 
less than the desired rate R [21]. If ^ is a singleton set 
containing H, the channel H is known at the receiver in which 
case mX;Y) = \ogdct (^I + HQ is the mutual 
information of a point-to-point link with m transmit and n 
receive antennas, transmit signal to noise ratio P{Jt) and 
input distribution Gaussian with covariance matrix Q [22]. The 
dependence of the index at the transmitter is made expUcit by 
writing the transmit SNR as a function of Jj. Note that Pi 
and P{i) will mean the same thing in this paper. 

Let n(0) denote the probabiUty of outage, where O is the 
set of all the channels where the maximum supportable rate 
mX; Y) is less than the transmitted rate R [21]. The system 
is said to have diversity order of d if 11(0) = SNR""* 
Note that all the index mappings, codebooks, rates, powers are 
dependent on the average signal to noise ratio, SNR. Specif- 
ically, the dependence of rate R on SNR is expUcitly given 
by R = r log SNR, where r is labeled as the multiplexing 
gain. The diversity-multiplexing tradeoff is then described as 
the maximum diversity order d{r) that can be achieved for a 
given multiplexing gain r. 

The following result captures the diversity-multiplexing 
tradeoff for the case of perfect receiver information and no 
transmitter information. For a given rate R = r log SNR and 
power P = SNRP, define the outage set 0{R,P) = {H : 
(/ + j^HQW^ < R}. Denote the result diversity order as 
G{r,p), which is n(C'(i?,P)) = SNR^'^^'''^'. The following 
result completely characterizes G{r, p) and is a straightforward 
extension of the result in [22]. 

Lemma 1 ([16]). The diversity-multiplexing tradeoff for P = 

' We adopt the notation of [22] to denote = to represent exponential equality. 
We similarly use <, >, <, > to denote exponential inequalities. 



SNR^ and r < pmin(m, n) is given by the (r, G{r,p)) where 

min{m,n) 

G{r, p) = inf (2i — 1 + max(m, n) — min(m, n))ai 

i=l 

where the inf is over all a\, - ■■ , amin(m,n) satisfying 

min{m,n) 

* *^miii(m,n) 

i=0 

Remark 1. G{r,p) is a piecewise linear curve connect- 
ing the points {r,G(r,p))= {kp,p{m — k){n — fc)), fc = 
0, 1, ... , min(TO, n) for fixed to, n and p > 0. This follows 
directly from Lemma 2 of [13]. G{r, 1) is the diversity 
multiplexing tradeoff with perfect CSIR and the transmit signal 
to noise ratio = SNR [22]. Further, G{r,p) = pG{^, 1). 

D. Summary of Results 

We will study the following four systems with different 
accuracy of channel state information (CSI) at the transmitter 
and receiver. 

1) CSIRTq! In this case, the receiver knowledge about H is 
assumed to be perfect and the transmitter is assumed to 
receive a noiseless quantized feedback from the receiver 
about channel H. This case was first studied in [23] and 
the diversity order increase was first proved in [17] with 
later extensions in [13]. The channel quantizer maps H 
to an index Jr which is then communicated over the 
noiseless feedback channel. Since the feedback is noise- 
less, Jt = Jr. Based on the index Jj, the transmitter 
adapts its transmission power as described in Section III. 
The diversity order for K levels of feedback is defined 
recursively as dRT^{r,K) = G{r,l -\- dRT^{r,K - 1)) 
where d^j^^ (r, 0) = and grows exponentially in number 
of feedback bits. 

2) CSIRTq: Our first set of results analyze the case of 
imperfect channel knowledge at the transmitter, where 
the errors are caused by errors in the received quan- 
tized feedback index. In this case, Jj ^ Jr with 
finite probability. We show that if a MIMO scheme 
optimized for equally likely input messages is used to 
send the feedback information, then the diversity order 
is limited to d^j. (r, K) given in Table I. However, 
the feedback information is not equally-likely. Hence, 
we propose a natural unequal error protection method 
labeled power controlled feedback, where the power 
control is performed based on input probabilities. The 
power-controlled feedback results in a diversity-order 
d^ (r, K), specified in Table I. With power-controlled 
feedback, the maximum diversity order increases from 
djj=P (0, K) = 2mn to d^j (0, K) = mn{mn -\- 2) for 
all K > 2. For K = 2, the maximum diversity order 
increases from d^^ (0, K) = 2mn to d^ (0, K) = 
mn{mn -\- 1). 

3) CSIRTq! Our next step will be to isolate the effect 
of errors in receiver knowledge of the channel. Thus, 
the receiver will estimate H, which will be mapped 
to Jr. Since the feedback is assumed to be perfect. 
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the transmitter knowledge is same as that of receiver, 
Jt = Jr- We will show that for a three-phase power- 
controlled training based protocol can achieve a diversity 
order of dg^ (r, K) = G{r, 1 + G(r, 1)). In fact, if the 
power-controlled training is not performed then channel 
estimation errors completely dominate and feedback 
is rendered useless; the resultant tradeoff collapses to 
dgj (r, K) = G(r, 1) for all K. So analogous to power- 
controlled feedback, power-controlled training appears 
essential to improve the diversity order in this case. 
4) CSIRTq: Finally, we put the above two cases together 
and derive the maximum achievable diversity order as 
(r, K) = G{r, 1 -|- G{r, 1)), which is equivalent to 
the case with only receiver errors. Thus, we conclude 
that the receiver errors dominate the achievable diversity 
order. 

Remark 2. An important word of caution for all the results 
(new and known) summarized in Table I. Unlike the previous 
work in [10, 21], we do not account for the resources spent in 
channel training and feedback in this paper. Resource account- 
ing can be performed using the procedure developed in [10], 
by scaling the multiplexing r appropriately. More precisely, 
the multiplexing r should be replaced by rTcoh/{Tcoh ^ 1) 
for CSIRTq, rTcoh/(Tcoh ~ 2m) for the three-phase protocol 
in power-controlled training of CSIRTq, rTcoh/(7coh — ™) for 
two-phase protocol in constant power training of CSIRTq, and 
''^coh/Clcoh — 2m — 1) for the three-phase protocol of CSIRTq. 
The resource accounting multipliers assumes that the feedback 
requires one channel use and the training requires m channel 
uses. The details are further explained in Remarks 3 and 5. 
Further, the use of the number of antennas to use and the 
protocols would need to be optimized for each multiplexing as 
in [10,21] and is omitted in this paper for readability. 

III. CSIRTq: Perfect CSIR with Noiseless 
Quantized Feedback 

The diversity-multiplexing tradeoff for the case of receiver 
with perfect information and noiseless quantized information 
has been extensively studied in [8, 13, 14]. In this section, we 
will discuss the main ideas for the single user MIMO channel 
model stated in Section II-A. We start off with an example to 
illustrate the main idea. 

Example 1 (SISO): Consider the case where m = n = 1. 
Without any feedback, the maximum diversity (at r ^ 0) 
is 1. The space of channels {H} = C can be divided into 
two sets: the outage set Oq = : < "^^j and its 

complement Oq = {H}\Oo; see Figure 3(a). The probability 
of the set n(Oo) « SNR"^ and n(Oo) « (1 - SNR"^). 

With one-bit of noiseless feedback, the receiver can convey 
whether the current channel H belongs to Oq or Oq. Since 
the outage event is rare, the transmitter can send much larger 
power than usual to reduce the outage probability as follows. 
When the feedback index is Jr = 1 representing event Oq, the 
transmitter will use transmit power |SNR/n(C)o) ~ |SNR^. 
For the feedback index Jr = 0, representing Oo, power 



iSNR is used. The average power used in the above two- 
level power control is iSNR/n(e>o) • n(Oo) + ^SNR • (1 - 
SNR-^) < SNR. With the above one-bit power control, the 
outage probability is w SNR~^ since the set of channels in 
outage is reduced to d = {iJ : < (?!!^i^^^I(Oo) | „ 

[h : \\Hf < i^J^}; see Figure 3(b). 

If the feedback rate was log2(3) bits/channel state, then 
the receiver could convey information about three events 
{Oi,Oo\Oi, Oq}. In state Oi, the transmitter can send power 
« SNR^ since n(Oi) « SNR"^ and thus reduce the outage 
probability to SNR~^. If we had two bits of feedback or 
equivalently K = 4 levels, then the diversity order will be 
4 and the outage region will be given by O3 which will be 
the set of all those channels which could not support rate 
R with power w SNR^; see Figure 3(c). Using the above 
recursive argument, for K levels of feedback, a diversity order 
of K = 2^ can be achieved, where b is the number of feedback 
bits per channel realization. ■ 

The above example captures the essence of the general result 
for the MIMO channels, given by the following theorem. 

Theorem 1 ([13]). Suppose that K >1 and r < min(m, n). 
Then, the diversity-multiplexing tradeoff for the case of perfect 
CSIR with noiseless quantized feedback is dRT^ (r, K) defined 
recursively as 

dj,T^{r, K) = G{r, 1 + di^r^ir, K - 1)), 

where dgj,i {r, 0) — 0. 

The main idea of the proof is along the lines of Example 1 
and is summarized as follows. Based on its knowledge of 

H, the receiver decides the feedback index Jr = Jj G 
{0, 1, • • • ,K— 1}, where = 2'' is the number of quan- 
tization levels and b is the number of feedback bits. If the 
transmitter receives feedback index Jj, it sends data at power 
level Pj^. Without loss of generality, -Po < -Pi < • • • < Pk-i- 
The optimal deterministic index mapping has the following 
form [13], 

{argminjei i, 1= [k: log det(/ + HWPk) > R, 
k€{0,--- ,K-l}} 
0, if the set I is empty 

Based on the above assignment of feedback indices, the 
optimal power levels can be found out as Pj = SNR^"'"^* where 
Pi are recursively defined as: po = 0, pj = G{r, l+pj_i)Vj > 

I. Using the above recursion, the optimal diversity is given by 
G{r,l -\- pk-i) which reduces to rfRT,(r, i^), thus proving 
Theorem 1. We note that special cases of Theorem 1 were 
also proved in [8]. 

We note that event that {H : \ogdet{I-\-HH'^ Pk-i) < R} 
corresponds to the outage event, because none of the power 
levels can be used to reach a mutual information of R. Thus 
any state can be assigned to the outage event and in fact, the 
diversity-multiplexing tradeoff is unaffected by which index 
is used to represent this state. Since the receiver knowledge 
is perfect, using the index Jr = ensures that the overall 
power consumption is minimized. However, when the receiver 
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TABLE I 

Summary of Diversity-Multiplexing Tradeoffs. See Remark 2 for a caution in using this table. 



Case 


Main Characteristic 


D-M Tradeoff 


CSIRT 


Perfect Information at T and R 


dKi{r) = oo Vr < min(m, n) 


CSIRTq 


Quantized Information at T 


dRT, (r, K) = G{r, 1 + ^rt, (r, if - 1)), dax, (r, 0) = 


CSIRTq 


Noisy information at T 
(Constant Power Feedback) 


d^J^r^K) =mm.{BK{T),mn-\-G{r,l)) 


CSIRTq 


Noisy information at T 
(Power-controlled Feedback) 


min^-i(mn(((?0+ - + dRT,(r, i))) 


CSIRTq 


Noisy information at R 
(Constant Power Training) 


ir, K) = G(r, 1) 


CSIRTq 


Noisy information at R 

(Power-controlled training) 


d^^^{r,K) = G{r,l + G{r,l)) 


CSIRTq 


No Genie-aided information 
(Power-controlled training & feedback) 


d^^{r,K) = G{r,l + Gir,l)) 




Fig. 3. Example 1: Channel events for the case of (a) no feedback (Oq is the set of channels in outage) and (b) one-bit feedback (d is the outage set) and 
(c) two-bit feedback (O3 is the outage set). 



or transmitter knowledge is not perfect (which will be the case 
in the rest of the paper), we will assign the perceived outage 
state the highest power level Pk-i, which helps reduce the 
outage probabiUty due to misestimation of channel. 



The maximum diversity order in Theorem 1 increases very 
rapidly with the number of feedback levels K. The maximum 
diversity order [13] d^j^{0,K) = which grows 

exponentially fast in the number of levels K. In fact, as 
iv" — )• 00, the diversity order also increases unboundedly. Since 
as K ^ 00, the feedback approaches the perfect feedback 
and as a result, perfect channel inversion becomes possible. 
In [3], it was shown that for perfect channel state information 
at transmitter and receiver, zero outage can be obtained at finite 
SNR if max(m,n) > 1. Perfect channel inversion essentially 
converts the fading channel into a vector Gaussian channel, 
whose error probability goes to zero for all rates less than its 
capacity. The unbounded growth of diversity order is rather 
unsettUng and is in fact, a fragile result as shown by our results 
in the following sections. 



IV. CSIRTq: Perfect CSIR with Noisy Quantized 

Feedback 

In this section, we will analyze the case when the receiver 
knows the channel H perfectly but the quantization index Jr 
is conveyed to the transmitter over a noisy feedback link, 
resulting in the event Jj ^ Jr with non-zero probability. 
We will consider two feedback designs. In the first design, 
the feedback channel will use a constant power transmission 
scheme designed for equally-likely symbols (which is the 
commonly studied case for i.i.d. data). Leaming from the 
limitations of constant power feedback transmission, we will 
then construct a new power-controlled feedback strategy which 
will exploit the unequal probabiUty of different channel events 
quantized at the receiver. 

A. Constant Power Feedback Transmission 

First, we observe the impact of using the power control 
described in Section III by considering Example 1. 

Example 2 (Impact of Feedback Errors, SISO): Consider the 
case of 6 = 2 feedback bits which allows K = 4 feedback 
indices. In this case, the receiver can convey four events, which 
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we choose to be O {Oq, Oq \ Ci, d \ C»2, 02} (shown in 
Figure 3(c)), where the events are described as 

0, = {H : log (1 + |i?|2SNRi+') < i?} , i = 0, 1, 2. (3) 

As discussed in the previous section, the power control in 
the forward channel uses four different power levels, Pj^ = 
SNR^+-^"^ for Jt = 0, 1,2,3. The probability of each of the 
above events and the associated power control is defined in 
Table II. The last column in Table II shows the impact of 
probability of events as seen by the transmitter when there 
are errors in the feedback link such that 11 (Jj — j\JR = 
i) = SNR^ for j ^ i as shown in Figure 4(b). For example, 
with noisy feedback, II{Jt = 3) = ELon(^T = 3|Jr = 
j)n( Jr = j) = SNR-^ + SNR-^SNR-'' + SNR'^SNR'^ + 
SNR-^ = SNR-\ 

In fact, there are two dominant error events, first being Oq 
(Jr = 0) being confused as either Oq \ Oi or Oi \ O2 or 
O2 (Jj = 1, 2 or 3) which amounts to limiting the maximum 
power that can be used and the second being the probability 
that Oi (Jr = 1) being confused as Oq {Jj = 0). The second 
error event has approximate probabihty of SN R~^ which limits 
the maximum diversity to 2. Thus, the event probabilities at 
transmitter are no longer the same in the presence of the 
feedback errors. As a result, the transmitter cannot use the 
power control Pj-^ = SNR^"'"'^'^ without exceeding the average 
power constraint. In fact, the highest power the transmitter 
can send is of the order = SNR'^, and the maximum diversity 
order of the constant power feedback mechanism is Umited to 
2 irrespective of the number of feedback bits. ■ 

We now generalize the above example to MIMO for ar- 
bitrary multiplexing gain. Define Bj{r) be defined by the 
recursive equation 

E,(r) = (0' _ ^■ = ^ 

' \G(r,l + min(mn,B,_i(r))), j>l 

Theorem 2. [16] Suppose that K > 1 and r < min(m, n). 
Then, the diversity-multiplexing tradeoff for the constant 
power feedback transmission with K indices of feedback is 
given by 

djff^{r, K) = mm{BKir), mn + G{r, 1)). 

Remark 3. Accounting for the feedback resources can be done 
as follows. In the limit of high SNR, the feedback will consume 
one channel use and hence to get the rate R = rlog(SNR), 
r should be replaced by rTcoh/(7coh — 1) in the above 
expression. However, the reader must note that there are 
implicit time factor terms which can be easily integrated and 
one may carry out an optimization on the diversity obtained 
as a function of these time loss terms for each multiplexing. 
For example, for r > min(m, n) (Tcoh — l)/2coh7 the feedback 
would not be useful due to the time spent and it is better to 
use a non-feedback based strategy. 

Much like in Example 2, the maximum diversity order for 
r — )^ is 2mn for all > 2. That is one-bit of feedback 
is sufficient to achieve the maximum diversity order with 
constant-power feedback. However, for r > 0, as the number 



of feedback levels K increases, the diversity order (r, K) 
also increases, such that {r, K) < G(r, 1) + mn for all r. 
Note that G{r, 1) is the diversity order without any feedback. 
Coincidentally, the diversity order of the feedback link is mn 
(feedback link is non-coherent where the error probability can 
decay no faster than SNR^™"), and ends up determining the 
maximum gain possible beyond G{r, 1). 

The key bottleneck in the above result is that the feedback 
link is using a transmission scheme optimized for equally- 
likely signals, which is appropriate if the feedback link was 
being used to send equally-likely messages like data packets. 
However, the information being conveyed in the feedback 
link is not equiprobable and hence the usual MIMO schemes 
optimized for equally-likely messages are not well suited. 
Again, in the context of Example 2, we will first show an 
alternate design of power-controlled feedback can improve the 
diversity order in the next section. Before we proceed, we note 
that our analysis in [10] for a TDD two-way channel yields the 
same maximum diversity order of 2mn. It is satisfying to see 
how two different feedback methods have identical behavior; 
their relationship is further explored in [24]. 

B. Power-controlled Feedback 

In this section, we exploit the unequal probabiUties of 
the outage events at different power levels to develop a 
power-controlled feedback scheme to reduce overall outage 
probabilities. Our feedback transmission wiU be designed for 
a non-coherent channel, since the feedback channel Hf is not 
known at the transmitter or the receiver. 

Example 3 (Power-controlled Feedback): For the constant- 
power feedback transmission scheme described in the Sec- 
tion IV-A, each input is mapped to a codeword with the same 
power and is pictorially depicted in Figure 5(a). However, 
since the events in set O are not equi-likely, we assign 
different power levels to each event. Now consider the power 
assignments shown in Table HI, also depicted in Figure 5(b) 
for the feedback channel. The average power over the feedback 
channel is 

SNR"(1 - SNR^^) + SNR^SNR-^ -t- SNR^SNR'^ 
+SNR~^SNR'' =^ SNR 

Here the rare events are conveyed with more power, which 
allows a more reliable deUvery of the feedback information 
without violating feedback power constraint. Thus, the power- 
controlled scheme resembles an amplitude shift keying. 

The reason that the probabiUties at the transmitter and the 
receiver are same for power-controlled is because the events 
Jj > Jr occur with exponentially small probability. The out- 
age probabihty can be lower bounded by n( Jy = 0, Jr = 1) 
which is an event when the receiver requested a higher power 
level than what it received. Note that n(jR = 1) = SNR~^ 
and n(JT = 0|Jr = 1) = SNR^"^. Hence, n(JT = 0, Jr = 
1) = SNR"^SNR"^ = SNR"^ is the probability of this 
dominant error event. Note that both the forward and feedback 
channel power constraints are satisfied. Hence, we can obtain 
a diversity order of 3, which is higher than the diversity order 
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TABLE II 

EXAMPLE 2: Probability of events at the transmitter and receiver with and without noise in the feedback channel. Receiver 

KNOWLEDGE IS ASSUMED TO BE PERFECT. (CAUTION: PROBABILITIES ARE ONLY REPORTED UP TO THEIR ORDER, AND CONSTANTS SUCH THAT THEY 

SUM TO ONE ARE OMITTED.) 



Event 


Feedback 
Index (Jr) 


Prob of Event 
at Receiver 


Prob at Transmitter 
(noiseless feedback) 


Prob at Transmitter 
(noisy feedback) 







(1-SNR-^) 


(1-SNR-^) 




Oo\Oi 


1 


SNR-i 


SNR-^ 


SNR-^ 


Ol\02 


2 




SNR-2 


SNR-^ 


02 


3 


SNR"^ 


SNR-3 


SNR-^ 



Event 


Input Prob 






1 -SNR-^ 






SNR-^ 




Oi\02 


SNR-^ 




O2 


SNR-^ 





Output Prob 
1-SNR-^ 

SNR-^ 

SNR-^ 

SNR-^ 




Output Prob 
P 1-SNR-^ 

SNR-^ 

/SNR-^^\ 

I I 

b \ SNR-^ ;' 



(a) 



(b) 



Fig. 4. Example 2: Input-output probabilities of the feedback channel, when (a) the channel is noiseless and (b) when the channel is noisy causing mismatch 
between transmitter and receiver knowledge, assuming that a code optimized for equaUy-hkely source is used. The noise in the feedback chaimel changes the 
output probabilities, as shown by the circled output probabilities. (Note that the sum of probabihties is more than one since we have omitted the constants 
for ease of understanding). 

TABLE m 

Example 3: Power assignment for noisy feedback channel. (Caution: Probabilities are only reported up to their order, and 

constants such that they sum to one are omitted.) 



Event 


Feedback 
Index 


Prob at Receiver 


Feedback Transmit 
Power 


Prob at Transmitter 
(noisy feedback) 


Oo 





(1-SNR-^) 




(1-SNR-^) 




1 


SNR^i 


snr2 


sm-^ 


Ol\02 


2 


SNR"2 


SNR^ 


SNR~2 


02 


3 


SNR-^ 


SNR^ 


SNR-^ 



of 2 obtained via constant-power feedback design (Example 2) 
but lower than 4 that can be achieved with a noiseless feedback 
channel (Example 1). ■ 




Fig. 5. Examples 2 and 3: Dominant error events when (a) the feedback 
channel uses a transmission suited for equally-likely messages and (b) a 
power-controlled feedback design where codewords are power-controlled 
based on their probabiUty of occurrence. 

We will generahze the above example to the case of general 
MIMO channels by using the following power-control for the 
feedback channel. Let the feedback power level for different 



feedback messages be Qo = SNR*, Qi = SNR«\ . . ., 
Qk-1 = SNR'^"^ with Qj > Qj-i. Assume for now that 
the set of powers {Qi} satisfy the feedback power constraint 
of SNR. 

We will use the maximum aposteriori probability (MAP) 
detection rule to detect the message transmitted by the re- 
ceiver on the feedback channel. If the transmitted signal is 
at signal to noise ratio level of Q, the received power is 
Sf = tracciHH^Q + WW'^ + ^/Q{HW^ + WH^)). Since 
we are encoding the feedback index information in signal 
power, we will compare the received power 5/ with different 
threshold power levels. We first note that for all qi < 0, 
the received power will be dominated by the noise term 
trace(VFVl^t)^ which is = 1 with high probability. Thus, we 
assume that qj > for j > 0. 

Suppose the eigenvalues of HW are (Ai,--- ,Am„) and 
Xi = SNR~"' where a, are the negative SNR exponents of 
the Ai, niN = min(m, n) and a = (ai, ■ ■ ■ , amw)- Then, the 
distribution of ai is given by the following result. 

Lemma 2. [22] Assume ai > a2 > ■ ■■ ^''^ power 
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exponents as described above. In the limit of high SNR, the 
probability density function of the SNR exponents, ol, of the 
eigenvalues of HH^ is given by 



min(a)>0 • 



(4) 



We now show that the thresholds for MAP decoding are 
SNRmax(qo,o)+e^ SNR9i+% .3NR«^-^+' for some small e > 
0. This can be derived by observing reason is that Ii{Jj > 
i|Jr = i) decays faster than polynomially^ in SNR and is 
hence = as long as the threshold for detecting Jj = i is 
above SNR«'+' for any e > 0. To see this, for < j < iiT- 1, 

n(JT>j|JR = j) 

= n (trace(£rJ?^SNR'"'^"(«-°)) + 1 > SNR'"""(«-°)+^) 

= n (^(SNR9^-"') + 1 > SNR"'^^fe.o)+^^ 

= n ^SNR™^^'^~™'""*'°^ > SNR™'*'^^*^'"^"'"^^ 

= n (max(gj — min aj , 0) > max(g'j , 0) + e) (5) 

We see that for the last expression to occur with finite prob- 
ability, minaj < which cannot happen with polynomially 
decreasing probability by Lemma 2 and hence 11 (Jy > jVr = 
j) = irrespective of e > 0. Now, for MAP detection, we 
would find a threshold between qi and qi+i so as to minimize 
n(JT = i + l, Jr = z)+n(JT = z, Jr = i + l). Since the first 
term = and the second term is minimized by choosing e as 
small as possible, choosing e small enough gives the desired 
threshold for MAP decoding. 

Further, for all Q>SNR, tTa.ce{HH'iQ + WW^ + 
y/Q{HW^ + WH^)) = ir&ce{HW)Q + 1. Hence, we 
obtain IV{Jt = 0|Jr = 1) = Ii{tT:&ce{HH^SHW) + 
1<SNR'"^^(''"'°)+'). Let \i be the eigenvalues of iJiJt and 
Ai = SNR""\ Then, 



n(jT = o|jR = i) 

= n (^trace(ifi?1^SNR«i) + 1<SNR" 



:(9o,0)+' 



(min(m,n) 
SNR91-"* + I < SNRmax(go,0)+e 

= n ^SNR™'"^'^~™™"*''^'<SNR™'"^'°''^^~''^^ 
= n (max(gi — min a,, 0) < max(go, 0) + e) 
= n (minai > qi — max((jfO) 0) — e) 

where the last step follows from Lemma 2. Since 
gi < 1 + G(r,l) (as n(jR = l)>SNR-^('''^^ and 
there is power constraint for feedback link of SNR), 
the outage probability is lower bounded by n(JT = 
0,Jr = 1) > SNR-^('"'i)SNR-™"(i+^('''^)-°) = 



SNR 



-mn(l+G(r,l))-G(r,l) 



. Further, we can use the same tech- 



nique to find that for any j < i, 

n(JT = i|JR = i) = sNR-™"(«^-'"''''(«^'°». (6) 

^Any function /(SNR) which decays faster than any polynomial in SNR, 
like exponential or super-exponential functions of SNR, are = since 

limsNR-t-cx. log(SNR) - 



Thus we obtain the following result. 

Theorem 3 (Power-controlled Feedback). Suppose that K > 

1 and r < min(m, n). Then, the following diversity- 
multiplexing tradeoff can be achieved with power-controlled 
feedback 



djff (r, K) = min dRj^ (r, K), max 

' y 93<l+*7-<,(r,j) 

min (mn((g,)+ - +dR7-,(r,i)) 
Proof: The proof is provided in Appendix B. 



Corollary 1. The diversity multiplexing tradeoff with one bit 
of imperfect feedback is same as the diversity multiplexing 
tradeoff with one bit of perfect feedback. In other words, for 
K = 2, d^f^{r, 2) = G{r, 1 + G{r, 1)) = rf«j-,(r, 2). 

Proof: For K = 2, the optimal choice of q is to 
maximize gj*" — q^ which gives optimal choice of go = and 
qi = 1 + G{r, 1). Using this for one bit of feedback, diversity 
of G(r, 1 + G(r, 1)) can be achieved with power-controlled 
feedback which is the optimal considering the upper bound of 
perfect feedback is dsj^^r, 2) = G(r, 1 G{r, 1)) . ■ 

Thus, we find that there is no loss of diversity with one 
imperfect bit of feedback compared to the case of perfect 
feedback. Hence, power controlled feedback scheme is better 
than the constant power scheme which was limited to a 
maximum of 2mn diversity even as r — )• 0. 

Corollary 2. As K ^ oo and r ^ 0, the maximum 
diversity that can be obtained with power-controlled feedback 
is mn{mn -\- 2), i.e. 



lim 

X— ^oo,r- 



djff {r, K) = mn{mn + 2). 



Proof: Note that in this case, choosing q^ — and qi = 
1 + dRT,(r, «) w 1 + mn '^^Zi 'Lmn>i + «lmn=i givcs the 
optimal diversity multiplexing point for r ^ and thus, the 
diversity is mn{mn-\-2). We further note that K = Sis enough 
to get this point. ■ 

Thus, even with power-controlled feedback, arbitrary num- 
ber of feedback bits do not yield unbounded increase in 
diversity order as in the case of CSIRTq, where the diversity 
order increases unbounded with K. 

Note that however, we restricted our attention to an ordering 
1j ^ 9^-1 which can be relaxed giving better results as in the 
following Lemma. However, for our objective of the achiev- 
ability for imperfect channel state at the receiver and imperfect 
channel state at the transmitter, the achievabiUty strategy in 
Theorem 3 is enough. Further note that the new achievability 
strategy which relaxes the assumption of qj > qj-i does give 
improved diversity, but still the diversity remain bounded with 
increase of the feedback levels. 

Lemma 3. Let qi > and q^ ^ q^ for any i ^ j, < 
i,j < K — 1. Then, the following diversity can be achieved 
by power-controlled feedback. 
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min I dRT^ (r, K), max 

V (90,--- ,9Jf-l)6Q 

• -.n ^ T-'i- ^ ^ {mn{qi - Qj) + dRT^{r, i)) ) , 

iyjEO^...^K—l:j<i and qjKQi J 

where Q = {(go, • • • , g^-i) : qj < 1 + 
mm{dRT^{r,j),mini^[o^...j_^,g.^g.{dRT^{r,i) + {qi - 
qj)mn))yO<j < K - 1}. 

Proof: The proof is a simple generalization of the proof 
of Theorem 3. The constraint on Q keeps the 11 (Jj = 
«)SNR^'<SNR for the power constraint and the diversity 
expression records the possible dominant outage events. ■ 

C. A Source Coding Interpretation 

The idea of power-controlled feedback transmission is akin 
to source-coding the feedback information. In conventional 
lossless source coding (e.g. Huffman coding), the currency of 
representation is bits. There, to minimize average code-length, 
the codeword length is (approximately) inversely proportional 
to the probability of an event. Rare events are represented by 
longer codewords while more frequent events are represented 
by shorter length codewords, thereby minimizing the average 
codelength. 

Analogously, our currency is average transmit power and 
objective is to minimize average error probabiUty. Thus rare 
events get higher power and frequent events lower transmit 
power. Note that there are many power allocations which will 
meet the power constraint but they will all result in different 
error probabilities. Our proposed feedback transmit power al- 
location minimizes the error probability (in asymptotic sense). 

V. CSIRTq: Estimated CSIR with Noiseless 
Quantized Feedback 

In this section, we will consider the case when the receiver 
obtains its channel information from an MMSE estimate, H, 
using training. As a result, the receiver index Jr is not always 
equal to the optimal index, J, based on actual channel H. We 
will model the relationship between J and Jr via an effective 
channel (described in Section V-C). The feedback link, on 
the other hand, is assumed to be noiseless. As a result, the 
transmitter index Jj = Jr. 

We will analyze two protocols. The first protocol, labeled 
constant-power training, trains the receiver once at the begin- 
ning using a constant power level. We show that feedback, 
even if noiseless, is completely useless in providing any 
gains in diversity order compared to a no-feedback system. 
Inspired by our understanding of the noisy feedback channel 
in Section IV, we propose a second protocol, labeled power- 
controlled training which utilizes the feedback and trains the 
receiver twice, where the second ttaining is power-controlled 
based on feedback information. The second protocol has a 
higher diversity order than any non-feedback system. 

A. Training the receiver 

In this subsection, we wiU consider MMSE channel estima- 
tion for a single user MIMO channel. The channel is estimated 



using a training signal that is known at the receiver. From the 
received signal, MMSE estimation is done as in [25] to get an 
estimate H of the original channel H. Let Xt be the training 
signal of size m x A'^ for some m < N < Tcoh that is known 
at the receiver and transmitter. The transmitter sends Xt and 
the destination receives Yt = HXt + Wt where Yt is a 
n X N received signal and Wt is the additive Gaussian noise 
with each entry from CN{0, 1). Following [25], the optimal 
training signal is 



Xi 



npo 



1 +J 



m Omx{N—m) \ > 



where /io is tuned to satisfy power constraint, denote mxm 
identity matrix and 0,nx{N-m) represents mx {N~m) matrix 
having all entries 0. Hence, n/uo — 1 = ■ Further, the 

channel estimate is given by 



H = Yt{xIXt + In)-^xI = Yt 



^N—mxm 



which can be rewritten as 



H = H- 



1 



W2 



JVSNR 



AfSNR 



1 



JVSNR 



(7) 



where W2 is the left n x m submatrix of Wt- We now note 
some properties of MMSE estimate. First, it is easy to see 
that expected value of (H — H)H is zero confirming the 
orthogonality of the error with the unbiased estimate. Further, 



the variance of any entry in {H — H) is 



1+^ 



Thus, H 



NSNR 

for analyzing 



and H are matrices in which each corresponding element 
is highly correlated with the correlation coefficient between 
corresponding element of H and H is p= ,^ _ . Further 

note that any N >m channel uses are equivalent 
the asymptotic performance. 

In general, if G and H are correlated with correlation 
coefficient p, the joint probability distribution function of 
eigenvalues of HW and GG^ is given by the following result: 

Lemma 4. [26] Consider two n x m random matrices 

H = (hij) and G — ifjij), i £ [!•"]• J ^ each with 

i.i.d complex zero-mean unit-variance Gaussian entries, i.e., 
E\hij\ = E[gij\ = Oyi,j,E[hijhlg] = ^^[ffii^J = ^ip^jg' 
where the Kronecker symbol 5ij is 1 or when i = j or i ^ j 
respectively. Moreover, the correlation among the two random 
matrices is given by E[hijg^p^ = p5ip5jq,\li,j,p,q, where 
p = \p\e^^ is a complex number with \p\ < 1. Let n < m and 
V = m — n. The joint probability distribution function of the 
unordered eigenvalues of HH^ and GG^ is 



exp 



(■ 



l-|p|2 



A(A)A(A) 



iU^^^.i^fxiXr det 



2\p\\/XkXi 



(8) 



where A(.) represents n— dimensional Vandermonde determi- 
nant, Ik{-) denotes the k*^ order modified Bessel function of 
the first kind, the eigenvalues of HW and GG^ are given by 
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A = (Ai, • • • , A„) and A = (Ai, • • • , A„) respectively. 

Note that although Lemma 4 assumed n < m, it can be 
extended to the other case of m > n since nonzero eigenvalues 
of and are the same. Hence for all n and m, let 
niN = min(m, n) and ly = \m—n\. Then, the joint probability 
density function of the unordered eigenvalues of HW and 
GGt is 



SNR" 



'lo<Q=s<i + SNR"^ " "1 



p{X,X) = 



exp 



1-IpP 



A(A)A(A) 



m^!m^!n"-oii!(j 



.(9) 



Recall that the eigenvalues of HW be (Ai, -- ,XmN)' 
Aj = SNR~"* and a = (ai, - ^amjv)- Similarly, let the 
eigenvalues of HH^ be (Ai, - - ,Am„), Aj = SNR~"' and 



a 



(Si, 



, Smjv)- The distribution of a^'s is given earlier 



in Lemma 2. We will now find the joint distribution of aj's 



and Si's. Let ai > a2 > ■ ■■ 
Further, we define 



and Si > S2 > • • • a„ 



1, 



,k, and 

(10) 



Ek = {{ex, a) : mm{ai,ai) > 1 Vz 
< ai = Sj < 1 Vi > fc} 

for all < A: < TTiAT. 

Theorem 4. Let H be the channel and H be the estimated 
channel. In the limit of high SNR, the probability density 
function of the SNR exponents of the eigenvalues of HH^ 
and HW is given by 



k=0 



(11) 



where ai > a2 > ■ ■ ■ ctmn, Si > S2 > 



• an 



J, and 



Cfe = SNR''^l"~™l+^'n^_jSNR~^^*~"^"'"'"~™'^"* 

^j-jmjv5|\jp-(2i-l+|n-m|)a;i^ (12) 

Proof: Since all the results were symmetric about inter- 
changing m and n, without loss of generality, we take m < n 
for the purpose of this proof and the rest of the paper. The 
proof is provided in Appendix A. ■ 

Remark 4. Since the receiver is trained at SNR, we note that 
for all OLi < 1, cci = Sj with probability 1, which means 
that the channel estimate is a reliable proxy for the actual 
channel. On the other hand, if Ui > 1, all we can state is 
that Si > 1 with probability 1. For example, if cti > 100, all 
we can reliably say about ai is that it is > 1. In SISO case, 
the above property of cti implies that the channel cannot be 
resolved below the noise floor since the noise dominates the 
training signal. The interesting implication in MIMO is that 
this result of noise dominance holds for all the eigen-values. 
None of the eigen-value of the channel can be resolved beyond 
Oii>lifai> 1. 

Example 4 (Asymptotic Distribution): For the case of 
m = n = 1, which is our running example, p{a,a) = 



min(a,3)> 1 ■ 



This density 



function will be used to analyze the diversity tradeoffs in this 
section. 



B. Constant-power Training 

We first note the decoding scheme with trained channel 
estimate at the receiver. If the receiver has estimate H trained 
with the power of SNR^ for some p > 0, the estimation error 
variance is SNR-^. As Y = HX + W = HX + {H-H)X + 
W, a lower bound on mutual information can be considered 
assuming that {H — H)X + W is Gaussian noise and hence 
the expression for the coherent channel with actual channel as 
H can be used as a lower bound [21]. 

The protocol is divided into three phases as described below: 
Phase 1: Training from Tx to Rx: The training is done using 
transmit power of SNR to obtain the channel estimate H. On 
the basis of this training, the receiver decides a feedback level 
Jr in the following way. Suppose that there are K power 
levels Pi = SNR^+P% i e{Q,--- ,K- 1}. The feedback index 
chosen at the receiver is given by 

'argminieii, l = {k: log det{I + H Pk) > R, 

kG{0,--- ,K-1}} 
K — 1, if the set I is empty 

Phase 2: The feedback index Jr e {0, • • • ,K — 1} is sent to 
the transmitter via the noiseless feedback channel. Thus, the 
feedback index received by the transmitter is Jj = Jr. 
Phase 3: The transmitter receives a feedback power level Jj 
and sends data at power level Pj^. = SNR^+^-'t. 
Example 5 (Constant-power training): The only error in chan- 
nel knowledge appears in the first phase, where the receiver 
is trained. 

Note that for MMSE estimation H and iJ = iJ - iJ are 
uncorrelated and since the receiver is trained with power of 
SNR, Jhe variance of if is = 1/SNR. Let = SNR"" 
and |Fp = SNR"^SNR"" where the probabiHty distribution 
function of a and 5 is p{a,a) = SNR~"~"lg>ol3>o- The 
probability of outage is 

K-l 

n{0) = ^n(o,jT = i) 

> n(o,jT = i) 

= n(o,jR = i) 

n(C',log(l + l^pSNR^) > rlogSNR, 



> 




iJpSNR) < rlogSNR) 



SNR^ 



l-hSNR2|Jf|2^ 
log(l + l^pSNR^) > rlogSNR, 
log(l I^I^SNR) < rlogSNR) 



< rlogSNR, 



n((2- a- (1 - 5)+)+ <r,(2 

(l-a)+ < r) 
SNR~ ™™(a,s)eA(a+5) 



S)"*" > r, 



(13) 
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)+< 



where A = {{a,a) : S > 0, a > 0, (2 — a — (1 
r, (2 - S)+ > r, (1 - 3)+ < r}. 

Hence, substituting 5 = and a = l — r + 6 for 6 small 
enough (Note that this choice of (a, a) is in A.) gives a bound 
on the above probability as: 



IL{0) 



> SNR 



-(l-r) 



(14) 



Thus, we see that the diversity of this scheme is at most same 
as the one without feedback. Hence, there is no advantage of 
feedback. ■ 

Theorem 5. Suppose that K > 1 and r < min(m, n). Then, 
the diversity-multiplexing tradeoff is given by d-^j. (r, K) = 
G(r,l). 

Proof: We will consider the third phase in this case, 
where we see that any increase in power levels do not 
help to increase the diversity. Achievability follows by 
not using any feedback. We will prove the_converse here. 
Note that for MMSE estimation H and H = H - H 
are uncorrelated and since the receiver is trained with 
power of SNR, the variance of H is = 1/SNR. Let 
A, be the eigenvalues of HH'^ and \i = SNR~"*. 
Further, assume that be the eigenvalues of HH^ and 
Xi = SNR"^SNR""'. The probability distribution function 
of a = (Si, • • • , am) and a = (5i, • • • , ctm) is 5) = 



min(ct)>0-'-miii(a!)>0* 



jjniNg|yjp-(2i-l + |n-m|)(ai+ai)-j^ 

probability of outage is 

K-l 

u{0) = 5^n(0,jT = i) 

> n(o,jT = i) 
= n(o,jR = i) 

> n(0,logdet(7 + ^^tPi) > r log SNR, 

logdet(7 + F^tPo) < r log SNR) 

^ nflogdetf/+ ] 

< r log SNR, 

logdct(/ + ^^1'Pi) >rlogSNR, 
\ogdct{I + HH^Po) < r log SNR 



The 



= n 1^X^(1 +Pi-^i- (Pi - min a)+)+ < r, 

m 

^(1+Pi-S,)+ >r, 

i=l 

m \ 
= S\'ifl-'^'^'^(ei,a)sA'E1Li{{'^i-'^ + \n-m\){ai+Si)) ^^^^ 

where A = {{a, a) : minS > 0, a > 0, X]t=i(l +P1 
{pi - mina)+)+ < r, X^Ilill + J^i J "»)+ > r,J2T=ii^ + 
Po — Si)"*" < r}. Hence, substituting aj = and allocating a 
as in [13] so that ^(l + Po — S,)+ = r — ^ for ^ small 



enough gives a bound on the above probability as: 

n(0) > SNR^ <i)e^ 5Z£i((2i-l+|n-m|)(Si+5i)) 



> SNR"^('''^+f°^ 

> SNR-^('^'i). 



(16) 



In the last step, po < else the transmit power constraint 
cannot be satisfied. Thus, we see that the diversity of this 
scheme is at most same as the one without feedback. Hence, 
there is no advantage of feedback for constant power training. 
This was observed for r — > in |27]. ■ 

Remark 5. Analogous to Remark 3, the accounting for the 
training resources can be done. In the limit of high SNR, the 
training will consume m channel uses to train m. antennas 
(as also seen in Section V-A) and hence to get the rate R == 
r log(SNR), r should be replaced by rT^oh/ (^coh ~ the 
above expression. However, these terms will be omitted in the 
sequel and can be similarly integrated. Much like in Remark 
3, one may carry out an optimization on the diversity obtained 
as a function of these time loss terms for each multiplexing to 
optimize over the number of antennas that need to be trained 
like in [10,21]. 

C. Power-controlled Training 

As we showed in the previous section, the receiver estimate 
was not good enough to help improve the diversity with feed- 
back. We now propose a power-controlled training protocol 
which can improve the outage performance using feedback. 
The protocol is again divided into three phases as described 
below: 

Phase 1: The transmitter sends the training signal using 
power SNR which is used at the receiver to obtain channel 
estimate H. On the basis of this training, the receiver decides a 
feedback level Jr in the following way. Suppose that there are 
K power levels Pi = SNR^+^'* , i e {0, • • • , ii: - 1}. (We will 
give the exact constants in front while proving achievabihty.) 
The feedback index is 



Jr = < 



argmm^giz, 



l^{k: logdet(/ + HH^Pk) > 

i?-Felog(SNR), 

ke{0,--- ,K-l}} 
if the set I is empty 



where e > is some small constant chosen. We will later 
substitute e — > 0. 

Phase 2: A feedback level Jr e {0, • • • ,K - 1} is fed back 
over the noiseless feedback channel to the transmitter, which 
implies Jy = Jr. 

Phase 3: If the feedback index Jy is received, the transmitter 
trains the receiver again at power level Pj^ which is followed 
by the data at power level ^j^. The trained channel estimate 
is denoted by H2 and let H = H — H2. 

The outage probability is defined as: n(C) = 



n ( logdet [1 + 



-t 



P(Jt) H2QH:, 
m i + ^trace(iJiJt) 



:)<-) 

(17) 
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TABLE IV 

Example 6: Power assignment for CSIRTq. (Caution: 
Probabilities are only reported up to their order, and 
constants such that they sum to one are omitted.) 



Event 


Prob at Receiver 


Training and 
Transmit Power 


do 
do\Oi 

02 


1 - SNR-^ 
SNR-^ 
SNR-^ 


SNR^ 
SNR^ 

snr3 

SNR"^ 



This is the effective outage probability using Gaussian code- 
books and considering the channel estimation error as noise 
[10]. 

Example 6 (Power-controlled Training): Since the receiver 
now does not know the value of channel estimate H, but only 
an estimate H, the events O, can in this case be defined as 



iJ : log ( 1 + |i?|^SNR'+* ) < Rj 0,1,2. (18) 



Figure 6 depicts the relation between J and Jr. Let a be the 
negative SNR exponent of while a be the negative SNR 
exponent of \H\'^. Let 



J 



argmiHieii, 



0, 



I={k: log(l + \H\^Pk) > R, 

ke{0,--- ,3}} 
if the set I is empty 



We assume that 3'"'^ phase is perfect for this example and 
hence H = 0. We will prove later that this interference error 
in (17) due to H does not make a difference asymptotically. 

Consider the event ( Jr < 2, J = 2) which would result in 
outage. The powers and the probabilities can be seen in Table 
IV. The outage probability for r — ^ is then 

U{0) > n( Jr < 2, J = 2) 

= n(iogdet(/ + ijij^Pi) > 
iog(i + liJpPi) < -R, 

log(l + \HfP2) > R) 
> n(2-S>0,2-a<0,3-Q!>0) 

= S|\|R™'^'<^2<c,<3,i<3<2(l-Q:-a) 

= SNRi-2-1 = SNR-=^ (19) 

Thus, the maximum diversity order is 2 with any number of 
feedback levels. Note that this is more than constant power 
training which was limited to 1. We will show later in this 
Section that this can be achieved with a single feedback bit. 
■ 

The above example can be generaUzed to MIMO systems 
as follows. 

Theorem 6. For K > I and r < min(m, n), the diversity- 
multiplexing tradeoff of d-^j (r, K) = G{r, 1 + G(r, 1)) can be 
achieved with power-controlled training. Further, the above is 
optimal for zero multiplexing. 

Proof: The proof of this Theorem is provided in Appendix 



C. We will first show that the diversity cannot be greater than 
mn{mn + 1) and later prove that the diversity multiplexing 
tradeoff of G{r, 1 + G{r, 1)) can be achieved for K = 2. ■ 
Note that dg^ (r, K) = dgj (r, 2) which means that diver- 
sity with imperfect receiver information with any number of 
feedback levels K > 2 is same as the diversity with perfect 
receiver information with 1 bit of feedback. 



VI. CSIRTq: Estimated CSIR with Noisy Quantized 
Feedback 

We observed in Section V that with CSIR obtained 
by MMSE training and perfect feedback, the diversity- 
multiplexing tradeoff of dg^ {r,K) = G(r, 1 + G(r, 1)) can 
be achieved with 1 bit of noiseless feedback. We also ob- 
served that the diversity-multiplexing tradeoff of d^ (r, K) = 
G{r, H-G(r, 1)) is also achievable with 1 bit of noisy feedback 
with perfect CSIR. In this section, we will show that diversity- 
multiplexing tradeoff dg^ (r. 2) can be achieved when both 
imperfections are present simultaneously: 1 bit of imperfect 
feedback based on noisy training-based receiver information. 
Thus, our main result is 

Theorem 7. For K = 2 and r < min(m, n), an achiev- 
able diversity-multiplexing tradeoff is given by d^ (r, 2) = 
G(r,l + G(r,l)). 

A. Protocol 

The complete protocol with both power-controlled training 
and power-controlled feedback constitutes of three phases as 
described below. 

Phase 1: The training is done using power SNR to get the 
channel estimate H. On the basis of this training, the receiver 
decides a feedback level Jr in the following way. Since the 
feedback is assumed to be only one bit, there are only two 
power levels at the transmitter. Denote the two power levels 
as Pi = Sm^+P%i G {0, 1} with po = and = G(r, 1). 
(We will give the exact constants in front while proving that 
the average power constraint will be satisfied.) The feedback 
index is 

'1 if logdet(/-F^^tPo^ <i? + eiog(SNR) 
otherwise 



Jr 



where e > is an arbitrarily small constant chosen as before. 

The above index assignment is simply choosing the higher 
power level if the lower power level is estimated to be too 
low. 

Phase 2: A feedback level Jr G {0, 1} is transmitted from the 
receiver which is received at the transmitter as Jj e {0, 1}. 
The receiver employs the power-controlled encoding scheme 
described in Section IV to send the feedback index. The power 
levels used for sending the feedback information are denoted 
as Qo = and Qi = SNR^+°('''^\ 

Phase 3: The transmitter gets a feedback index Jj which is 
used to train the receiver at power level Pjj and then send data 
at power level Pj^. The trained channel estimate is denoted 
H2 and let H = H - H2. 



14 



Noiseless 



Event 


Index ,/ 






n 
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Index J 
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Receiver 
Index Jr 
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(a) 



(b) 



Fig. 6. Example 6: Obtaining channel information at the receiver: (a) if the receiver has perfect information, then it knows the correct feedback index, else 
(b) the receiver index is not knovm perfectly and can be viewed as an output of a noisy charmel. 



The outage probability of the above three-phase protocol is 
upper bounded by 



n(C») = n(logdet(^/+- 
H2QHI 



Jt 



< R 



(20) 



1 + ^^trace(i?i?t) 

The above expression for the effective outage probability 
considers the channel estimation error as noise [10] and hence 
is only an upper bound for the optimal scheme. Following 
identical steps as in Section V, we find that the interference 
error due to H does not impact the analysis asymptotically 
and can thus be ignored. Hence, 

n(0) = n (log det (/ + P{Jt)H2HI^ < i?) . (21) 

Now define J as 

1 if log det (j + H2HIPo^ < ii and 

log det (7 + -&2^2^i) > R 
otherwise 



J= < 



Using the analysis as in Sections 111 and IV, we observe that 
n(JT = G| Jr = 1) = SNR-'""(i+'^('^'i», 

n(jT = i|jR = o) = 0, 
n( Jr = 0, j = 1) = 0. 

We now spht n(0) in (21) into 8 terms depending on the 
values of J, Jr and Jy as foUows, 

n(0) = U{logdet{I + P{Jt)H2HI) < R) 
111 

= E E E n(log det(/ + P{Jt)H2HI) < R, 

i=0 j=0 k=0 

J = t,J^=j,Jj = k). (22) 

Note that the terms with Jr ^ Jj can be asymptotically upper 
bounded SNR~°('~'^+'^^'~'^^\ Also, the term corresponding to 
i = j = k = can be upper bounded by SNR"'^('''^+^('''^)) 
smce when J = 0, logdet(/ + ^(0)^2-^2) < R happens 
when log det(/ + Px^hI) < R. Further, Jr = and J = 1 
happens with probability = 0. Thus, the only remaining case 



is when Jy = Jr = 1. Hence, 

111 

n((^) = EEEnaogdet(/ + P(JT)^2^|)<i?, 

i=0 j=0 fc=0 

J = «, Jr = j, Jt = k) 

+n(l0gdct(/ + P(JT)^2^|) < R, 
Jt = 1,Jr = 1) 
< SNR"'^('''^+'^('''^'' 

+n(logdet(7 + P1H2HI) < R 
= SNR-°('-'i+«('-'i». (23) 

Hence, the diversity order of G{r, 1+G(r, 1)) for multiplexing 
gain r can be achieved with only one bit of feedback. We 
now show that the power constraint is also satisfied which 
completes the proof. Recall that the power levels at the 
transmitter are denoted by Pq and Pi, while at the receiver 
are denoted by Qo and Qi. 

Let Po = ^ and Pi = jp^^xyy. The average power 
used at the transmitter is Pon( Jj = 0) + Pin(JT = 1). To 
show that this average power < SNR, it is enough to prove 
that 

n(JT - 1) < 2(n(jR = 1)). (24) 

Thus, the left hand side 

n(jT = i) - n(jT-i,jR = o) + n(jT = i,jR = i) 
< n(jT = i|jR = o) + n(jR = i). (25) 

Note that the first decays faster with SNR than n(jR = 1), 
this the above < 27r(jR = 1). 

Further, let Qo = and Qi — 2(nfjR=i)) • average 
power used at the receiver is Qon(jR = 0) + QiIL{Jr = 
1) < SNR. 

Thus, diversity order of G{r, 1 + G{r, 1)) can be achieved 
with imperfect feedback and imperfect CSIR. Hence, one bit 
of imperfect feedback and imperfect CSIR is same as one bit 
of perfect feedback and perfect CSIR except the time losses 
in trainings and the feedback. 
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VII. Numerical Results 

First consider the case of ni — 1 and n — 2. The 
diversity multiplexing tradeoff in the various cases for 1 or 
2 bits of feedback can be seen in Figure 7. We will now go 
through all the different tradeoff curves in the order of the 
legend from top to bottom. The first line G{r, 1) represents 
the diversity obtained with no feedback (CSIR and CSIR 
have identical performance since time lost in training is not 
accounted in our expressions) and also the diversity obtained 
in the case when the feedback is perfect and the receiver 
is trained through a constant power symbol (CSIRTq with 
constant power training). The second piecewise linear curve 
represents the diversity obtained when the receiver knows 
perfect channel state information but one bit of constant power 
feedback is sent over a noisy feedback channel (CSIRTq with 
constant power feedback). The third line G{r, 1 + G(r, 1)) 
represents the diversity obtained with 1 bit of perfect feed- 
back (CSIRTq). This also represents the diversity obtained 
when receiver knows the channel perfectly while 1 bit of 
power-controlled feedback is provided on a feedback channel 
(CSIRTq with power-controlled feedback). This diversity is 
also obtained when the feedback link is perfect while the 
receiver is trained using power-controlled training symbols 
(CSIRTq with power-controlled training). Further, the diversity 
obtained when the receiver does not know channel state infor- 
mation and the feedback link is noisy is also G{r, 1 + G(r, 1)) 
(CSIRTq with power-controlled training and power-controlled 
feedback). Note that the second curve started from diversity 
of 4 at zero multiplexing, but then followed the third line 
after r w 0.5 since it cannot perform better than the perfect 
feedback case. 

Next, we consider three levels of feedback. More bits of 
feedback do not increase the diversity in any case except 
when the receiver knows the channel perfectly. In this case, 
the next two curves lines show the effect of constant power 
feedback and the power controlled feedback on the diversity. 
The last line shows the performance with 3 levels of perfect 
feedback. The diversity multiplexing tradeoff curve achieved 
with constant power starts from 4 and hits the line of 3 levels 
of perfect feedback. 

We will now see the effect of increase on SNR on the outage 
probabilities at a constant multiplexing gain. We will focus on 
1 bit of feedback. In Figure 8, rn = n = 1 and r = 0.2. Thus, 
the theoretical diversity for the case of no feedback is 0.8 
while for all other cases considered in the Figure is 1.6 and 
we find that we obtain close to the expected diversity order at 
SNR of about 20 dB. Note that the differences in the higher 
diversity order curves is small and they seem on top of each 
other in the plot. In Figure 9, m = 1, n = 2 and r = 0.5. In 
this case, the theoretical diversity for the case of no feedback 
is 1 while with feedback is 3 which can be noted from the 
slope. 

VIII. Extension to Multiuser MIMO 

In this section, we extend our main result for point-to-point 
systems, i.e. Theorem 7, to the case of multiple access channel 
under the model of common feedback to all transmitters. 



G(r,1) 

1 bit of constant power feedback withi perfect CSIR 

G(r,1+G{r,1)) 

3 levels of constant power feedback with perfect CSIR 

3 levels of power-controlled feedback with perfect CSIR 

- - 3 level perfect feedback with perfect CSIR 




Multiplexing 



Fig. 7. Diversity multiplexing tradeoff in various scenarios. 
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Fig. 8. Diversity multiplexing tradeoff in various scenarios. The various 
parameters chosen ai'e: m=l, n=l, r=0.2. The channel estimates are obtained 
using a training time of 10 channel uses. 



A. Multiuser Channel Model 

Consider a multiple access channel with L transmitters 
Ti where each transmitter has m transmit antennas and the 
receiver R has n receive antennas. The channel is constant 
during a fading block of Tcoh channel uses, but changes 
independently from one block to the next. 

The received signal can be written in the matrix form as 

{T,}^R:y- H,X, + W. (26) 

l<i<L 

Here, W is of size n x Tcoh represents additive white Gaussian 
noise at the receiver with all entries i.i.d. C7V(0, 1). We 
consider a Rayleigh fading environment, i.e. elements of Hi 
are assumed to be i.i.d CA^(0, 1). The transmitters are subject 
to an average power constraint such that the long-term power 
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Fig. 9. Diversity multiplexing tradeoff in various scenarios. The various 
parameters chosen are: m=l, n=2, r=0.5. The channel estimates are obtained 
using a training time of 10 channel uses. 



< SNR for 



is upper bounded, i.e, ;j^^trace(E 
l<i<L. 

The feedback path to the transmitter in an orthogonal 
frequency band is given by 



R ^ T,: : r, 



HfiX 



Wf, 



(27) 



where Hr 



link to 



I f,i is the MIMO fading channel for the feedback 
j*'' user, normalized much like the forward link. 
The feedback transmissions are also assumed to be power- 
limited, that is the reverse link has a power budget of 



-trace(E 



XtiX 



< SNR/. Without loss of generality, 
we will assume a symmetry in resources, such that SNR = 
SNR/. 

Finally, we assume that the receiver computes the common 
feedback indix Jr £ {0, • • • ,K — 1}, which is broadcast 
over the downlink and received at transmitter i as Jji G 
{O,--- ,K-1}. 

B. Diversity-Multiplexing Tradeoff 

The diversity multiplexing tradeoff for single user MIMO 
channels was described in Section II.C. Here, we extend 
that discussion to MIMO MAC channels. As before, we 
concentrate on single rate transmission. The dependence of 
rates on the SNRs is explicitly given by Rg = r^logSNRs. 
We refer to r = {rs)i<s<L as the multiplexing gains. Let 
H = {Hi, ■ ■ ■ , Hl^. Further, let the channel estimates at the 
receiver be H = {Hi, ■ ■ ■ , H^}. 

In a multiple access channel, corresponding outage event is 
defined as the union over the events that the channel cannot 
support target data rate for some subset of the users [28], union 
over all the subsets. Hence, for a multiple access channel with 
L users, each equipped with m transmit antennas, and a re- 
ceiver with n receive antennas, the outage event is O = IJ ©5. 

s 

The union is taken over all subsets C {1, 2, • • • , L}, and Os 
is the set of all the channels where the sum transmitted rate 



by these \S\ users is less than the maximum supportable rate 
by the MIMO link from these \S\ users to the destination. The 
system is said to have diversity order of d if n(0) ^ SNR^''. 
The diversity multiplexing tradeoff for the multiple access 
channel can be described as follows: given the multiplexing 
gains r for all the users, the diversity order that can be achieved 
describes the diversity-multiplexing tradeoff region. 

The probability of outage with rate R — (i?i, R2, ■ ■ ■ , Rl), 
transmit power P{Jjj) — PiVJji and perfect channel state 
information H at the receiver is denoted by S(R, P) = 



n C'5(R, P) J . If we assume that the receiver knows the 
channel perfectly as H, we denote the event [jOsiJ^,'P)) by 

[/h(R, P) where we assume that H is the perfect channel 
knowledge at the receiver. 

Let i:i(r,p) be defined as S(R,P) = SNR~^^'"'P^ where 
r = {ri,r2, ■ ■■ , r^) and p (pi,_P2, ■ ■ • ,Pl)- We further de- 
note function G{r,p) in Section II by Gm.„(r, p) to explicitly 
depict that this is for m transmit and n receive antennas. 

Lemma 5. [16] Let — p for all 1 < s < L. Also, 
let ^ Ti < minds'!™, n) for all non-empty subsets S of 

ies 

{1,2, • • • ,L}. Then, 



D{v, p) — minG 



\S\r 



E 



(28) 



C. CSIRTq: Estimated CSIR with Noisy Quantized Feedback 

All the results related to quantized feedback in this paper 
can be extended to multiple access channel where there is a 
feedback level sent from the receiver and all the transmitters 
receive this signal and adjust the power accordingly. To 
demonstrate the extension, we consider a symmetric system 
where all transmitters have a statistically identical channel to 
the receiver with identical average SNR and employ the same 
power control thresholds. Furthermore, we will only consider 
a single-bit feedback, which implies that simultaneously all 
transmitters will be instructed to use the low power level or 
the high power level. However, since we assume independent 
errors in the feedback links, each transmitter may or may not 
transmit at the right power level. Under the above conditions, 
an achievable diversity-multiplexing tradeoff is given by 

Theorem 8. For K = 2 and r = (ri, • • • , r^) with J2 ''"i — 

les 

min(|5|m, n) for all non-empty subsets S of {1, 2, • • • , L}, 
an achievable diversity-multiplexing tradeoff for a multiple- 
access channel is given by d-^ (r, 2) = D{r, 1(1 + D{r, 1))) 
where Ix denotes ( 

Proof We will provide the main steps to prove the above 
result based on the following three-phase protocol. First define 
po = and pj = D{r, 1(1 +pj_i))Vj > 1. 
Phase 1: Each transmitter trains the receiver using power 
SNR to get the channel estimate Hi at the receiver. On 
the basis of this training, the receiver decides a feedback 
level Jr in the following way. We consider two power levels 
Pi = SNR^+^Si e [0, 1]. We will state the exact constants for 
power control while proving that the average power constraint 
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will be satisfied. The feedback index is 

J = / 1 if C/H(R+lelog(SNR),lPo) = 1 
otherwise 

Intuitively, we choose the higher of the two power levels if 
the lower power level is not sufficient to avoid outage (even 
for one of the users) based on the estimated channel. 
Phase 2: A feedback level Jr G {0, 1} is sent from the re- 
ceiver but each transmitter receives Jji S {0, 1} according to 
the power controlled feedback scheme in Section IV-B. Since 
the feedback links have i.i.d. errors, different transmitters may 
receive different feedback indices. 

Phase 3: The transmitter k gets a feedback power level Jj^, 
sends a training signal to the receiver at power level Pjj^ 
followed by data at power level Pjj^. The channel estimate 
based on this power-controlled training is denoted H2.k- Let 
Hk = Hk- H2.,k- Further, H2 = {H2,ir-- ,^2x1. Also, 
denote as the \S\m'x n matrix formed by concatenation 
of Hi in S. Similarly, define H2S and Hg. 

The outage probability is bounded from above by sum of 
outage probabilities for each transmitter. The analysis of Phase 
3 is similar to Appendix C-B 1 since the estimation error in the 
third phase can be neglected for diversity multiplexing tradeoff 
purposes. Now define J as 

J = / 1 if t^fiJR' 1^0) = 1 and t/g^(R, IPi) = 
\ otherwise 

Since the third phase estimation error can be neglected, the 
outage probability is 

n(0) = n(c/a^(R,p(jT)) = i). 

Hence, repeating the analysis of single user systems and using 
union bounds in Section VI, we get the same results as in 
single user systems, but with G replaced by D and single 
multiplexing gain replaced by multiplexing gain vector. 

We show by example how to extend all the steps. n( Jr = 
1, J = 0) can be written as 

n(jR = o,j = i) 

= n I log det(/ + SNRHsH^g) > ^ RiMS and 

logdet(7 + SNRHssHJs) <^Riior some S 

ies 

and logdet(/ + PiH2sHJs) > ^i?iV5 j (29) 

We will now bound the above statement as follows. First define 
the following events 

As = {log det(7 + SNRHsHt;) >^i?i}, (30) 

Cs = {logdet(/ + SNRH2sHL)<E^'}' ^^l) 
Ds = {logdet(7 + SNR^+fiH2sH^s)>5^i?i}(32) 

We note (29) is the probabiUty of HsAs n (UgCs) n 



r^sDs C Us{As n Cs). Hence, 

n( Jr = 0, J = 1) 

< [ logdet(J-hSNRHsH^) > ^iJ, and 

s \ ies 

logdet(7 + SNRH2sHL) <J2^'] ^^^^ 

Note that this term is similar to that in single user, and for 
each S is = 0. Thus, the above probability = 0. 

Similarly, all other steps for single user MIMO channels 
can be extended to MIMO MAC system. Hence, the diversity 
order of -D(r, 1(1 + £'(r, 1))) can be achieved with imperfect 
feedback and imperfect CSIR for L transmitters. ■ 

In [28], diversity-multiplexing for multiple-access case was 
considered without feedback. The achievable diversity multi- 
plexing without feedback is -D(r, 1). In [14], it was shown that 
with 1 bit of perfect feedback, the diversity multiplexing of 
Z)(r, l(l + Z)(r, 1))) can be achieved when the receiver knows 
perfect channel state information. In this paper, we show that 
the diversity multiplexing tradeoff of I?(r, 1(1 + -D(r, 1))) 
can be achieved even when the receiver is trained on a noisy 
channel and the feedback index is also sent on an orthogonal 
noisy channel with power-controUed training and feedback. 

IX. Conclusions 

In this paper, we find the diversity tradeoff for a non- 
symmetric FDD system in which the errors in MMSE channel 
estimate and the quantized feedback channel are accounted 
for a single user and a multiple access channel. We find 
that diversity multiplexing tradeoff of a system with 1 bit of 
feedback over a noisy channel and MMSE channel estimate 
at the receiver is the same as that of a system with 1 bit of 
perfect feedback and perfect channel estimate at the receiver. 
More importantly, we show that additional bits of feedback 
do not increase the diversity order of the system at constant 
rates. 

The approach in this paper has also been used to improve the 
performance of a TDD system as is summarized in [24]. The 
two models, FDD and TDD, consider the two extreme cases 
of the correlations between the forward and the backward 
channel. As a next step, one can consider what happens if 
the forward and the feedback channel are correlated, but not 
exactly the same. 

This paper suggests that one round of training provides 
a certain resolution to the channel gain which Umits the 
diversity multiplexing tradeoff performance. The strategies can 
be extended to a multi-round communication between the 
sender and the receiver that allows better channel resolution 
at the nodes. This multi-round extension for both FDD and 
TDD models can be seen in [32]. 

Also, this paper assumes a Rayleigh fading channel model. 
The authors of [29] consider a general model for fading which 
includes Rayleigh, Rician, Nakagami and Weibull distributions 
to find the diversity multiplexing tradeoff for a system with 
no feedback and perfect channel estimate at the receiver. The 
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extension of the feedback cases to general fading models is 
still open. 

Finally, the two way channel model can be extended to 
consider delays in the feedback channel. If there is a delay 
in the feedback process, the transmitter can decide to send 
some data as if there is no feedback till it receives feedback 
and then try to use the feedback to improve the diversity by 
sending power controlled data (possibly correlated with the 
data transmitted before feedback is received) in the remaining 
time. 
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Appendix A 
Proof of Theorem 4 

We first note some properties of I„{x), modified Bessel 
function of first kind, that will be used in the proof. The series 
expansion of Iv{x) is given as [31, Equation 9.6.10][30], 



1 



1=0 ^ ' 



(i) 



(34) 



When |a;| is large and |arg(x)| < | , asymptotic expansion 
of Iv{x) is given by [31, Equation 9.7.1] 

(/x-1)(m-9) 



1 



1 



+ 



+ • 



(35) 



/27rx L X 1\{%xY 

(M-1)(a^-9)(m- 25) 
3!(83;)3 

where \x = 4j/^. 

Now, for the proof of Theorem 4, we will use Lemma 4. We 
will further suppose that rrifq = m without loss of generality. 
Let A = \i : ai+OLi < 2}. We will evaluate p(a, a) in the 
following five disjoint cases which comprise the whole space 
of possibilities. 

1) niin(Q!„i,am) > 1 ( or (a, a) G Ey^. 

2) min(am, < 1, a™ + > 2. 

3) min(aTO,am) < 1, + < 2, Ui ^ oti for some 
i € A. 

4) mhi{am,cim) < 1, ctm + Oim < 2, at = cxi for all 
i€A, (a,a)^UL"o'W 

5) (a,a)eUr=o'(^^^)- 

Now, we consider all of the cases one by one as follows. 
1) min(aTO,am) > 1 : Using Equation (9), 



exp 



(- 



A(A)A(A) 



m\mmjS^^j\{j + u)\\p\ 
n^=i(VAX)"det 



Since p = , ^ , we substitute p = 1 and 1 — p^ = 
1/SNR to obtain 



p(a, a) = exp ( — 



1/SNR / 

A(SNR-")A(SNR-")SNR-^-i("'+"-) 



det 



2-\/SNR-°fcSNR- 
1/SNR 



m!m!n™-o^j!(j + n-m)!sMk 
As A(SNR-") = SNR-^^i('-^'"\ we get 

p{a,a) = SNR™SNR-^£i('+'^)(«'+"*) 



(36) 



We will now find det 
we get 



In-m (SNRl-^) 

/.(SNRi-^) 



• (37) 



. Using (34), 



/.(SNR-^) 



= E 



'SNR^" 



2i+v 



s-^ I I SNR 2 

i\{i + v)\ 



1=0 



(38) 



Let per(fci, ^2, • • • , km) for (fci, k2,--- , km) a permu- 
tation of (1, • • • , m) be defined as follows 

per{ki,k2,- ■ ■ ,k,n) 

if (fci, fc2, • • • , fcm) is an even 
permutation of (1, • • • , m) 

1 if (fci, fc2, • • • ) km) is an odd 
permutation of (1, • • • , m) 



• (39) 



Thus, 



det U, 



. (SNRi- 



SNRi- 



1 / SNR^ '- 



k, 



i=0 



S|\|R''E£i(l-^T^) y^(_l^Per(fci,fc2,--- ,fem) 



2i 



2 — ttfc, +5; 



SNR''""'"' 



(40) 



The above equation is same as Equation (56) in [33] 
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with K = 
and hence 



V2-1 



det 



( SNRi- 



SNR 



l-Cti 



and A,: 



SNR 



l-Si 



= SNR''5:r=i(i-^)sNR2^-i(i-i)(i-^) 

= SNR^"-(''-'+''K^-^). (41) 

Substituting in Equation (37), we get 

p{a,a) = SNR'"SNR-^£i('+'^)("'+"^) 
SNREr=i(2-2+.)(i-^) 

= SNR'""SNR"^'^i*^*+^^^'^"*+"'^ 

= SNR™"SNR~ ^*=i^^'+"~"*~^^^"""'""*^ 
= em. (42) 

2) min(a„i,am) < 1, + Sm > 2: Using 

Equation (36), we see that all tenns except 



E"iSNR- 



remain 



the 



same 



^^P \^ l/SNR 

and hence are polynomial in SNR while 
exp ( — ^•=^ ^'^wsNR^^^'^ — ") decreases exponentially 



l/SNR J 

in SNR and hence p{a, a) = 0. 
3) min(am,am) < 1^ ctm + o^m < "2, ai ^ a,; for some 
i & A: We will prove p{a, S) = in this case. For this 



we consider exp (-^%rp2^) det 



I. 



2|p|VAfcA, 



and prove this part to decrease exponentially and we 
would be done since rest of the terms are polynomial in 
SNR. Using (35), 



det 



2|p|VAfcA; 



exp 



2|P|/ 
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l-p2 



ni^i,„,,+s,>2Poly(SNR)), (43) 

where Poly(SNR) represents the term that are polyno- 
mial in SNR. First observe that 



exp 



Afcj + A; 
1-^2 



exp 



2|plVAfc,A; 
l-p2 



< 1, 



(44) 



and hence the product of such terms cannot increase 
exponentially with SNR. 



2\p\\/Xk\i 



-L. y^^_]^^per(fci,fc2,--- ,fcm) 
k 

nr=iexp(-SNRi-™"("'=''"')) 
nr=i,a„+3,<2exp(SNRi-^)Poly(SNR) 



n: 



= l,afc, +3i<2 



,>2exp(-SNR^-™("'=!'"')) 
exp(-SNRi-™("'=!'"'') 



exp(SNR^" 



') J Poly(SNR) 



< 



per(fci ,k2,--- ,km) 



E(-i) 



-^4=l,afej+ai>2-^inin(a;fej,a,)>l 



■rrm 
-'■J-i=l,. 



,^=a,Poly(SNR) 



(45) 



Now, we will show that each term under the sum decays 
exponentially with SNR. For this, the product of the 
above indicators should be 0. Let us now consider all 
the scenarios when the above product of indicators is 1. 
If some ai < 1, and pairs with a^, to give + afe, < 
2, then the two must be equal and if it pairs to give 
(ii + Oiki > 2, the product of indicators will always be 
0. If S; > 1 and pairs with to give a; + < 2, it 
can only happen when the two are equal and if it pairs 
to give Q/ + fi/,, > 2, then Ofe, > 1. Hence the only 
pairing that will work is that all elements of ai < 1 are 
matched to a] < 1 and also > 1 is not mapped to 
Sj < 1. This can happen only when ai = cti whenever 
aj < 1 or Sj < 1 and all the rest are > 1. If ccj ^ Sj 
for some i G A, the above condition do not hold. This 
proves Case 3. 

4) min{am,cim) < 1, ctm + cim < 2, aj = Sj for all 
i G A, {a, a) ^ Ufc^^ i^k)' We see that all the analysis 
of Case 3 holds for this case and hence if a, = Sj for 
all i e A and {a, a) ^ Ufcl^j^C-^fc)' ^ element > 1 
is mapped to an element < 1 which makes the product 
of indicators zero and hence the probabiUty decreases 
exponentially with SNR. 

5) a, a e UA~o^(-^fc)- These are the cases we sum 
over in the statement of the Lemma. In each of 
these cases, p{a,a) exist. When we integrate over 
a, we find that integral of X^fe^felfifc ^-r-t- is 

nm CMp-(2i-l+n-m)ai-, -, 
j=l-5lMK iai,... ,afe>lio<afc + i,--- ,tt™„<l- 

We next note that in order to find the polynomial 
expressions associated, the two exponentials 
multiplication cannot decrease with SNR for which we 
would need A;; = Z in (43) (since all the A's and A's 
are ordered) and hence from all the expressions, we 
find that p{cx, a) = pi{cx)p2{6i.) for any Ek. Using the 
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separability, we find that 

_ 5|yjpfe(n-TO+fc)jjfe^^5|y|p-(2i-l+|n-m|)ai 

n^'^SNR"^^*"^"'"'""'"'^"*. (46) 



Appendix B 
Proof of Theorem 3 



In this appendix, we prove that the diversity in the statement 
of Theorem 3 can be achieved. Let S(-R, P) be the probability 
of outage when the transmitter uses power level P and rate R 
is required. If P = SNR^, then S{R,P) = SUR'^^'^'P^ and 

U{0)<S{R, Pk-i) + ni-^T <i,Jr= i). 

i=l 

We assign the feedback index Jr at the receiver as 

'argminjgii, l = {k: logdct(/ + HH^Pk) > R, 
Jr=< ke{0,--- ,K -1}} 

K — 1, if the set I is empty 

We will now find n( Jj = i) for i > 1 as 

n(JT = 

K-l 



= 5^n(jT = i,jR = i) 

3=0 

K-l 

= J2U{JT=i\jR=j)U{jR=j) 



i=o 

n(jR 
n(jR 



K-l 

i)+ n(jT = i|jR = i)n(jR = j) 

i). (47) 



The above steps foUow from the fact that 11 (Jj = i| Jr 
j) = if j < i. Further, 

J2 = i\Jr = j)u{Jr = j) 
j=i+i 

K-l 

< Yl n(jR = j)<n(jR = z). 

Let the power levels be chosen as 



(48) 



^ when i = 

SNR 



when i > ' 



Further, Qi < Pi. We first note that the power constraints are 
satisfied. Note that as before, P, = SNR^+^'* and Qi = SNR«*. 
Also, Pi = SNR1+*^^(''''V« and S{R, P,) = SNR-'^''^''(''''+^\ 

Thus, n(jR = ?:) = n(jT = i) = snr^*^"'''^'). 

We will now show that using Pj as given above, we achieve 
the desired diversity multiplexing tradeoff using the following 
computation. 



n(0) 

< S(P, Pk-i) + Y n(^T <i,JR = i) 



K-l 



i=l 
K-l 



SNR-*^'^^''"^) + Y n(JT < i\JR = iMJr = i) 



i=l 
K-l 



= SNR-*^^^''"^) + Y n(^T < i\JR = i)SNR-*^o 



i=l 



K-l 



_|_ 5|\jp-mn(max(gi,0)-max(gi_i,0))g|yjp-dRTg(r,i) 



= SNR"''''^'!^''''^^ 



(49) 



Appendix C 
Proof of Theorem 6 



We will first show that the diversity cannot be greater than 
mn{mn +1), and then show that diversity of G{r,l + G{r, 1) 
can be achieved with 1 bit of feedback. 



A. Converse 

In this subsection, we will prove that we cannot get more 
diversity with > 2 levels of feedback with imperfect CSIR 
as compared to 1 bit of feedback with perfect CSIR at zero 
multiplexing. 

For this, we assume that the third phase is perfect and thus 

H = 0, and H2 = H. Thus, 

n(0) = n(logdet(/ + P(jR)ifif^) < R). 
Define J as 

' arg minjgi i, I = {fc : log dct(/ + HH^Pk) > R, 
J=< k€{0,--- ,K-1}} 

0, if the set I is empty 

The probability of outage is then 

n(C») = n(logdet(/ + P(jR)iIift) < P) 

> n(logdet(7 + P{Jr)HH^) <R,Jr = 1) 
= n(logdet(/ + PiHH^) < R, 

logdet{I + PoHW) < P + elog(SNR), 

logdet(/ + Pii?i7^) > 

P + elog(SNR)). (50) 

Let Xj and be the eigenvalues of HW and HW 
respectively. Further let aj and aj be the negative SNR 
exponents of the corresponding eigenvalues. Then, Equation 
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(50) reduces to 

n(0) > n(logdet(7 + PiiJiJ1') < i?, 

logdet{I + PaHH^) < i? + elog(SNR), 
logdet{I + PiHH^) > i? + elog(SNR)) 
> n(n^"Li(l + AjSNRi+fi) < SNR'' and 
n7Li(l + AjSNR) < SNR''+' and 
njLi(l + AjSNR^+Pi) > SNR'^+'^) 

= n(y~^(l +pi - aj)+ < r and 

m 

— Sj)+ < r + e and 



(51) 



^(l+pi-a,)+ >r + e). 
For r — )• 0, the above reduces to 

m 

n(0) > n(^(l+pi -aj)+ < and 
i=i 

m 

^(1 - aj)+ < e and 

m 

5^(l+pi-a,)+>e). 

i=i 

Thus, the choice of aj = 1 + pi and all aj = 1 for e — >^ 
can be used for the outer bound on outage probabihty. This 
gives n(e')>SNR-'""(^'i+^\ As pi < mn, the diversity < 
mn{mn + 1). 



(52) 



B. Achievability 



In this subsection, we will show that the diversity gain of 
G(r, 1 + G'(r, 1)) can be achieved with 1 bit of feedback with 
imperfect CSIR. We will first prove that the interference error 
due to -ff in third phase do not make a difference and thus 
can be removed. 

1) Analysis of Phase 3: Let the actual channel be H while 
H be the error in the estimate of H in the third phase of 
training which is power controlled. As a result, we obtain 



n(0) 



n(o, Jt = j) 



E 

K-1 

^ n(logdet(l + - 

3=0 ^ 

Jt=3). 



< R, 



Consider any term in the sum above, the receivei^is trained 
with power Pj, and hence let eigen- values of P, if if t be = 



SNR Let a = (ai, • • • , am) Then, 
U{0) 

K-l 

= ^n(logdet(l + 



1+P,trace(iifft)' 

K-l 

= J2 n(logdet(l + Pj.&s^lSNR-^-™")"') < R, 

3=0 

Jt = j)- 

Now, since H2 and H are uncorrected, probability that 
(— mina)+ > decreases higher than polynomial in SNR 
and hence, 

K-l 

n(0) = ^n(logdet(l + Pjff2^2^SNR-(-™°)"') 

3=0 

<R,Jj= j) 

K-l 

= n(log det(l + PjHiHl) <R,Jt= j) 

3=0 



= n(logdet(l + P{Jt)H2HI) < R). 



(53) 



Thus, 



U{0) = n(logdct(/ + P{Jj)H2HI) < R) 

Note that the correlations between H2 and H are same as 
between H and H and thus, there is no difference in using 
H in place of H2 for the purpose of calculating the diversity 
multiplexing tradeoff. We will now only focus on isT = 2. 

2) Analysis of Phase 1: In this section, we see how the 
feedback error in first phase decays with SNR. 



Jr 



1 if logdet(/ + HH'^Po) < R + elog(SNR) 
if logdet(i + HH^Po) > R + elog(SNR) 

Let us define J as 

r 1 if logdct(/ + H2HIP0) < R and 
J = I log det(/ + H2HIP1) > R 
( otherwise 

The probabihty of outage is then 

U{0) 

= n(log det(7 + P{Jt)H2HI) < R) 

= n(logdet(7 + P{Jj)H2HI) < R,J = Q,Jr = 0) 

+n(logdet(/ + P{Jt)H2HI) < R, Jr = 1) 
+n(logdet(/ + P{Jj)H2HI) < i?, j = 1, Jr = 0) 
< n(log det(/ + P1H2HI) <R,Jr = 0) 

+n(logdet(/ + P1H2HI) <R,Jr = 1) 

+n( Jr = 0, j = 1) 

= n(log det(7 + P1H2HI) < R) 

+n(jR = 0,J=l). (54) 
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Denote eigenvalues of HW by and the negative SNR 
exponents of Aj as Sj. Also denote eigenvalues of H2H2 by 
Aj and the negative SNR exponents of A^ as a^. 
Then, n( Jr = 0, J = 1) can be bounded as 

n(jR = o,j = i) 

= n (log det(/ + ^2^2^SNR) < E and 

logdet(7 + ^2^|SNR^+Pi) > a^d 
logdet(7 + ^^tsNR) > i? + elog(SNR)) 

= n(n™i(l + AiSNR) < SNR*^ and 

n™ 1(1 + A,SNRi+f^) > SNR'' and 
n™i(l + AiSNR) > SNR^'+^j 



< n 1^(1- < rand 
\i=i 

m 

^(1 +pi - > r and 



= 0. 



(55) 



The last step follows since expanding in terms of Ek gives 
— = — for all cases of non 

exponentially decreasing probability. 

We now show that the diversity order of G{r, 1 + G{r, 1)) 
can be achieved. From Equation (54), the probability of outage 
for X = 2 is 



n(o) 



< 



n(iogdct(/ + P1H2HI) < R) 
+n( Jr = 0, j = 1) 

< SNR-'^('''^+Pi\ 



(56) 



Thus, diversity order of G{r,l + G(r, 1)) can be achieved 
which is the optimal diversity order for perfect training in 
[13]. 

Lastly, we show that the power constraint can be satisfied 
with the above choice of powers. 

Lemma 6. Power constraint is satisfied for K = 2. 

Proof: Let Po = ^ and P, = 
The power constraint is Poli{Jj = 0) + Pin(JT = 1) < 
SNR, which trivially holds. ■ 
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